Goto

Collaborating Authors

 relation constraint






Constrained Information-Theoretic Tripartite Graph Clustering to Identify Semantically Similar Relations

Wang, Chenguang (Peking University) | Song, Yangqiu (University of Illinois at Urbana-Champaign) | Roth, Dan (University of Illinois at Urbana-Champaign) | Wang, Chi (Microsoft Research) | Han, Jiawei (University of Illinois at Urbana-Champaign) | Ji, Heng (Rensselaer Polytechnic Institute) | Zhang, Ming (Peking University)

AAAI Conferences

In knowledge bases or information extraction results, differently expressed relations can be semantically similar (e.g., (X, wrote, Y) and (X,’s written work, Y)). Therefore, grouping semantically similar relations into clusters would facilitate and improve many applications, including knowledge base completion, information extraction, information retrieval, and more. This paper formulates relation clustering as a constrained tripartite graph clustering problem, presents an efficient clustering algorithm and exhibits the advantage of the constrained framework. We introduce several ways that provide side information via must-link and cannot link constraints to improve the clustering results. Different from traditional semi-supervised learning approaches, we propose to use the similarity of relation expressions and the knowledge of entity types to automatically construct the constraints for the algorithm. We show improved relation clustering results on two datasets extracted from human annotated knowledge base (i.e., Freebase) and open information extraction results (i.e., ReVerb data).